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Abstract 

The search for generally applicable methods in swarm intelligence aims 
to gain new insights about natural swarms and to develop design method- 
ologies for artificial swarms. The ideal would be a 'swarm calculus' that 
allows to calculate key features such as expected swarm performance and 
robustness on the basis of a few parameters. A path towards this ideal is 
to find methods and models that have maximal generality. We report two 
models that might be examples of exceptional generality. First, we present 
an abstract model that describes the performance of a swarm depending 
on the swarm density based on the dichotomy between cooperation and 
interference. Several examples are given how the model fits to results 
from typical swarm experiments. Second, we give an abstract model of 
collective decision making that is inspired by urn models. A parameter, 
that controls the feedback based on the current consensus, allows to un- 
derstand the effects of an increasing probability for positive feedback over 
time in a decision making system. Several applicable methods, such as the 
description as Markov process, calculation of splitting probabilities, mean 
first passage times, and measurements of positive feedback, are discussed 
and applications to artificial and natural swarms are reported. 



1 Introduction 

Research in the context of swarm intelligence is important in biology to gain 
new insights about natural swarms and also in fields aiming for artificial swarms, 
such as swarm robotics, to obtain sophisticated design methodologies. Ideal 
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tools would allow to calculate fundamental features of swarm behavior, such 
as performance, stability, and robustness, based on an input of a few observed 
parameters in the case of natural swarm systems or a few designed parameters 
in the case of artificial swarms. We call this highly desired set of tools 'swarm 
calculus' (calculus in its general sense). The idea is to establish a set of math- 
ematical tools that can easily be applied to a variety of settings in the field 
of swarm intelligence and that can be combined in many ways by using them 
as mathematical building blocks for modeling. Thus, models will surely be an 
important part of swarm calculus. 

In order to define a general methodology of understanding and designing 
swarm systems, general properties and generally applicable models need to be 
found. Today only few models exist that have the potential to become gen- 
eral swarm mode ls. Biological swarm m o dels are parti c ularly distinguished 
by their variety (Okubo and Levin . 2001 ; Okubol. 1986 ; IVicsek and Zafeirisl 
20121 lEdelstein-Keshetl 120061 ICamazine et all l200lh . Typically each biolog- 
ical challenge is answered by a specialized model. The desire for a model 
with applicability to all natural swarms seems to be small in that commu- 
nity. In the field of artificial swarms, such as robot swarms, the desire for 
generality seems to be bigger which is, for example, expressed by severa l mod 
els in swarm robotics (IHamann . l2010t iBerman et all 1201 It iProrok et all 12011 



Milutinovic and Lima , 2007t Lerman et al. , 20051) . The idea of these models is 



to support the design of swarm robotic systems within a maximal range of appli- 
cations. They focus on features that describe quantities of the swarm behavior, 
such as robot distributions or required times for certain tasks and typically 
struggle between the intended generality and having direct connections between 
the model and the actual description of the individual robot's behavior. If 
we abandon, however, the demand for a detailed description of behavioral fea- 
tures and focus only on high-level features such as overall performance or the 
macroscopic process of a collective decision then a higher degree of generality is 
achievable. In addition, that way it is possible to define concise mathematical 
descriptions that, in turn, allow applications of standard methods from statis- 
tics, linear algebra, statistical mechanics etc. In this paper, we identify two 
models of universal properties of swarm systems concerning the dependence of 
swarm performance on swarm density and the dependence of collective decisions 
on positive feedback. Next we give an introduction to each of the two fields. 



1.1 Swarm performance 

By 'swarm performance' we denote the efficiency of a swarm in a certain task. 
For example, the swarm performance can be a success rate of how often the 
task is accomplished in average, it can be the average speed of a swarm in 
collective motion etc. Here we are interested in the swarm performance as a 
function over 'swarm density' which is how many agents are found in average 
within a certain area. For the following reason the function of swarm perfor- 
mance depending on swarm density cannot simply be a linear function. For a 
true swarm scenario a very low density (e.g., corresponding to one agent in the 
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whole area) has to result in low performance because there is neither coopera- 
tion nor a speed-up. With increasing density the performance increases because 
of a simple speed-up (e.g., two robots clean the floor faster than one) and be- 
cause of increasing opportunities of cooperation (assuming that cooperation is 
an essential beneficial part of swarms). In natural swarms this performance 
increase with increasing swarm size is revealed in increased productivity and 
the eme rgence of increased div i sion of labor as an indicator for increased coop- 
eratio n ( Jeanne and Nordheim , 1996 ; Karsai and Wenzel , 19981: iGautrais et al. , 
2002t iJeanson et all 12007). Even a superlin ear performance increase is pos- 
sible in this interval (IMondada et all 120051) . At some critical/optimal den- 



sity (Schncider-Fontan and Mataric, 199 1 



off and then decrease ( Arkin et al 




the performance curve will first level 
because improvements in coopera- 
tion possibil ities will be lower than the drawback of high densities, namely 



interference ( Lerman and Galstvan , 20021) . With further increase of th e density 
the performance continues to decrease (jGoldberg and Mataric . 19971) . Hence, 
swarms generally face a tradeoff between beneficial cooperation and obstruc- 
tive interference. However, it is to be noted that interference has also positive 
effects both within certain swarms and as a tool fo r desig ning swarm algo- 



rithms (jDussutour et all 12004 iGoldberg and Matarid 1199 



:sigr 



It turns out that not only these qualitative properties are similar in many 
swarm systems but also the actual shapes of swarm performance over swarm 



size/density graphs (see Fig.|2(a) ). Examples are the performance of forag ing in 



a group of robots (Fig. |2(b)] and Fig. 10a in (jLerman and Galstvanl . I2002D ). the 
activation dynamics and information ca pacity in an ab stract cellular automaton 
model of ants (Figs, lb and lc in (Miramontes . 19951 )). and even in the sizes of 
social networks (Fig. 8b in ( StrogatzT 200lh ). A similar curve is also presented 
as a h ypothesis for per capita output in social wasps by Ijeanne and Nordheim 
(|l996l) . furthermore, notice that these shapes are typical in probability distri- 
butions such as Weibull, Wigne r, and log-normal . The e xistence of this general 
shape was already reported by l0stergaard et aL ( 2001 ) in their expected per- 



formance model for multi-robot systems in constrained environments: 

We know that by varying the implementation of a given task, we 
can move the point of "maximum performance" and we can change 
the shapes of the curve on either side of it, but we cannot change 
the general shape of the graph. 

A related set of models are traffic models of flow ov e r den sity. The 'fundamen- 
tal diagram' of traffic flow (ILighthill and Whithaml . Il955l) is symmetric, more 
realistic models propose at least two asymmetric phases of free and synchro- 
nized flow. Actual measurements on highways show curves with shapes similar 



to Fig. 2(a) (e.g., see Fig. 6-4 in ( Mahmassani et all . 2009)). In these models 



there exist two densities for a given flow (except for maximum flow) similar 
to the situation here where we have two swarm densities for a given swarm 
performance. 
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1.2 Collective decisions 



In the context of swarm intelligence, collective decision-making is a process that 
is distributed among a group of agents without global knowledge. Each agent 
decides based on locally sampled data such as the current decision of its neigh- 
bors. There are many biological systems showi ng collective decisio n- making, 



for example, food source choic e in ho ney bees (jSeelev et all 119911 ). nest site 



ders (Saffre et al 



selection in ants (Mallon et all 120011 ). and escape route search in social spi- 



19991 ). Collective decision-making systems are often modeled 



as positive-feedback systems that utilize initial fluctu ations which are amplified 
and that way help to converge to a gl obal decision (jDeneubourg et al. . 119901: 



Mallon et all l2001t iNicolis et all 120111 ) . Interesting features of collectiv e deci- 



sions are, for exam ple, the speed-ac c uracy trade-off (INicolis et al. . 2011 ) or the 
influence of noise ( Dussutour et al. . 120091 : lYates et all 2009 ). Furthermore, it 



turns out that posit ive feedback is not a lways productive but can also generate 

" 20111) . 



irrational decisions (Nicolis et al 



In the following we limit our investigations to binary decision processes be- 
cause they allow for a concise mathematical notation and hence allow an easy 
application of mathematical methods. The investigated systems are either in- 
herently noisy (e.g., explicit stochastic processes within the agents' behaviors) 
or can validly modeled as noisy processes (e.g., deterministic chaos with strong 
dependence on the initial conditions). We are not interested in the quality of 
the final decision, that is the utility of choosing option A over option B or vice 
versa, and assume that there is no initial bias to one or the other. In selecting an 
appropriate model for collective decisions our main concern is simplicity while 
keeping attention on the relation of how much is needed as input to the model 
and how much is generated by it. We want to keep the number of parameters 
small while achieving descriptions of qualitative aspects of collective decisions 
as effects of the model. In order to obtain these standards we choose a mini- 
mal macroscopic model which has only one state variable describing the current 
status of the collective decision within the swarm (e.g., 80% for option A and 
consequently 20% for option B). 

For simplicity we view the asynchronous distributed process of collective de- 
cisions as a round-based game which allows only one agent at a time to either 
revise its current decision or to convince a peer to revise its current decision. 
For this purpose we re-interpret well-known urn models as models of collec- 
tive decisions and ex tend them appropriately . We u se simple models inspired 
by the urn model o f Ehrenfest and Ehrenfestl (1907) and by the urn model of 
Eigen and Winklerl (|l993f ). 



1.2.1 Ehrenfest urn model 

This urn model was originally introduced bv lEhrenfest and Ehrenfe"st ( 1907 ) in 
the context of dissipation, thermodynamics, statistical mechanics, and entropy. 
The dynamics of the model is defined as follows. Our urn is filled with N mar- 
bles. Say initially all marbles are blue. Whenever we draw a blue one, which 
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is for sure in the first round, we replace it with a red marble. If we draw a red 
marble we replace it with a blue one. Obviously the two extreme states of either 
having an urn full of blue marbles or an urn full of red marbles are unstable. 
Similarly this is true for all states of unevenly distributed colors. To formalize 
this process we keep track of how the number of blue marbles (w.l.o.g.) changes 
depending on how many blue marbles were in the urn at that time. We can 
do this empirically or we can actually compute the average expected 'gain' in 
terms of blue marbles. For example, say at time t we have Bit) = 16 blue 
marbles in the urn and a total of N = 64 marbles. The probability of drawing 
a blue marble is therefore Pb = gf = 0-25. The case of drawing a blue marble 
has to be weighted by —1 because that is the change in terms of blue mar- 
bles in that case. The probability of drawing a red marble is Pr = || = 0.75 
which is weighted by +1. Hence, the expected average change AB of blue mar- 
bles per round depending on the current number of blue marbles B — 16 is 
AB(B = 16) = 0.25(-l) + 0.75(+l) = 0.5. This can be done for all possible 
states yielding 

AB(B) = -2^ + l (1) 



which is plotted in Fig. 1(a) Hence, the average dynamics of this game is given 
by B(t + 1) = B(t) + AB(B(t)). 

The recurrence B t = Bt-i + 1 — jjjj-Bt-i can be solved by generating func- 
tions (jGraham et all Il998f) . For Bq = we obtain the generating function 

G(*) = Efe(£VV ( 2 ) 




The tth coefficient [z*] of this power series is the closed form for B t . We get 

k<t \ J 1 64 

Hence, for initializations Bq = and the symmetrical case Bq = 64 the 
system converges in average rather fast to the equilibrium B = 32. The actual 
dynamics of this game is, of course, a stochastic process which can, for example, 
be modeled by B(t + 1) = B(t) + AB(B{t)) + £(t), for a noise t erm £. 



S everal genera lizations have been proposed for this urn model ( Kra fft and Schaefer 



19931 : iKleinL [19561 . however, mostly these investigations were focused on math- 



ematically tractable variants. In the following we report generalizations which 
are focused on applications to feedback processes. 

1.2.2 Eigen urn model 

The Ehrenfest model can be interpreted as an example of a negative feedback 
process. Deviations from the fixed point B = 32 are correct ed by negative feed- 



back (the predominant color will be diminished in average) . lEigen and Winkler 
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32 64 32 

B B 

(a) Ehrenfest urn model (b) Eigen urn model 



Figure 1: Expected average change in the urn models. 



( 1993) reported a similar urn model to show the effect of positive feedback. In 
this model drawing a blue marble has the effect of replacing a red marble by 
a blue one and vice versa. The expected average change of blue marbles per 
round changes accordingly to 

ABwJf- 1 ' ^M*. (4) 
0, else 



for a plot see Fig. 1(b) 



While we still have an expected change of AB = for B = 32 as in the 
Ehrenfest model, the fixed point B = 32 is unstable now as its surrounding 
drives trajectories away from B = 32 and towards the stable fixed points B = 
and B = 64 respectively. In Sec. [3] we introduce a more general urn model that 
takes the intensity of positive feedback as a parameter and which can be used 
to model collective decisions in swarms. 



2 Universal properties of swarm performance 

Having identified the two main components (cooperation and interference) and 
the typical shape of these graphs we can define a simple model. The idea is to 
fit this model to empirical data for verification and predictions. 



2.1 Simple model of swarm performance 

For a given bounded, constant area A the swarm density p is defined by the 
swarm size N according to p = N/A. We define the swarm performance P 
depending on swarm size N by 

P(N) = C(N)(I{N) -d) = aiiV fc a 2 exp(cTV), (5) 



for parameters c < 0, 01,03,6 > 0, and d > (see Fig. 2(a) I. Parameter d is 
subtracted to force a decrease to zero (limAr^oo I(N) — d = 0). The swarm per- 
formance depends on two components C and /. First, the swarm effort without 
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negative feedback is defined by the cooperation function (see also Fig. 



2(a)) 



C(N) = ai N b 



(G) 



This function can be interpreted as the potential for cooperation in a swarm 
that would exist without certain constraints, such as p hysical c ollisio ns or other 
spatial restrictions. The same formul a was used by iBrederl (11954 to model 
the cohesiveness of a fish school and by iBierknes and Winfieldl (|2010h to model 
swarm velocity in emergent taxis. However, they used parameters of b < 1 while 
we are using mostly values of b > 1. Second, the interference function (see also 



Fig. 2(a) ) is defined by 



I(N) = a 2 exp(ciV) 



(7) 



with d used for scaling (e.g., Yithn-^oo I{N) — d). The interference function 
can also be interpreted as the swarm performance achievable without cooper- 
ation, that is, achievable swarm performance without positive feedback. The 
expone ntial decrease seems to be a re asonable choice, for example, compare Fig. 
10b in (jLerman and Galstvan . 2002 ) which shows an exponentially decreasing 
efficiency per robot in a foraging task. 



2.2 Examples 

To prove the wide applicability of this simple model we fit it to some swarm 
performance plots that were available. We investiga te four scenarios: forag- 
ing in a group of robots (Lerman and Galstvan . 20021). collective decision mak - 
ing (|Hamann et al.l . l2012t ) based on BEECLUST (jSchmickl and HamannLl201ll) 
aggre gations in t ree- like structures and redu ction to shortest paths (jHamann . 
l2006f ) similar to ( Hamann and Worn . 2008), and the emergen t taxis scenario 



(also s ometimes called 'alpha algorithm'. iNembrini et al.l (|2002l ): IBierknes et al 
l|2007f n. 

Given the data of the overall performance, the four parameters of eq. [5] can 
be directly fitted. This was done for the first three of the four examples in 
Fig. [21 The equation can be well fitted to the empirical data. In case of the 



foraging scenario (Fig. 2(b)) we also have data of the efficiency per robot. We 
can use the model parameters, that were obtained by fitting the model to the 
overall performance, to predict the efficiency per robot which is a function that 
we suppose to be proportional to effects by interference. This is done by scaling 
the interference function linearly and plotting it against the efficiency per robot. 



The satisfying result is shown in Fig. 2(b) 



We analyze the forth example, emergent taxis, in more detail. The following 
empirical data is based on a simple simulation. This simulation is noise-free 
and therefore robots move in straight lines except for u-turns according to the 
emergent taxis algorithm. First, we measure the performance that is achieved 
without cooperation. This is done by defining a random behavior that ignores 
any characteristic feature of the actual emergent taxis algorithm. For example, 
in the emergent taxis algorithm, robots count the number of neighbors and do 
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Po.25 



! cooperation 

swarm performance 




(a) Model of cooperation and interference, 
examples of swarm performance (eq.0, co- 
operation (eq. [B}, and interference (eg. [7t 
depending on swarm size. 



group efficiency 



P 0.004 




(b) Foraging in a group of robots, eq. [5] fit- 
ted to group efficiency (upper solid line), pre- 
diction of interference (lower solid line, ef- 
ficiency per robot, linearly rescaled), data 
points extracted from Fig. 10 i n Lcrman 
et al. l lLerman and GalstvarJ . I2002T ); P gives 
group/robot efficiency 



Po.5 



Po 



1 1 1 

50 100 

N 

(c) Collective decisio n mak- 

ing jHamann et al| l2012h based on 
BEE CLUST ilSchmickl and Hamannl , 

l201lf ) ; values of P = 1 would indicate that 
100% of the swarm have found a decision; 
P = indicates symmetry between options 



' 1 J 

0.025 0.05 

P 

(d) Aggregation in tree-lik e structures an d 
reduction to shortest path l|Hamannl . [2006); 
P gives the ratio of successful runs 



Figure 2: Model of cooperation and interference and three scenarios with fitted 
performance P according to eq. [5] 
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N 

(a) Model fitted to data of random and 
emergent-taxis behavior; P is covered dis- 
tance by barycenter times N. 



0.006 
0.004 
v 0.002 


-0.002 

10 20 30 40 

N 

(b) Histogram of barycenter speeds v for differ- 
ent swarm sizes. Note the bimodality about in 
the interval N 6 [16,40]. 



Figure 3: Performance of a random behavior and the actual self -organized emer- 



gent t axis behavior (also so metimes called 'alpha algorithm') (jNembrini et al 



2002t iBierknes et all 120071 ) with fitted model (eq. [5]); histogram showing two 
phases. 



u-turns if this number drops below a threshold a. To obtain the cooperation- 
free behavior we have set this parameter to a = in the simulation. Hence, no 
robot will ever u-turn and they basically disperse in the arena. A simulation 
run is stopped once a robot touches a wall. The performance of the swarm P is 
measured by the total distance covered by the swarm's barycenter multiplied by 
the swarm size (i.e., an estimate of the sum of all distances that were effectively 
covered by each robot). The performance obtained by this random behavior can 
be fitted using the interference function of eq. [7] (interpreting the interference 
function as a measurement of swarm performance without cooperation). The 
well fitted interference function and the empirically obtained data is shown in 



Fig. 3(a) labeled 'random'. The bias towards the light source (positive covered 
distance) is due to the initial positioning of the swarm closer to the wall that is 
farther away from the light source. In a second step, we keep the interference 
function fixed and fit the full model of swarm performance P (eq. [5]) to the 
data from the actual emergent taxis scenario by only varying the cooperation 
function (i.e., fitting ^ and b while keeping a 2 , c and d fixed). The fitted 
swarm performance mode{3 is shown in Fig. 3(a) labeled 'emergent taxis'. This 



simple model is capable of predictions, if the interference function has been 
fitted and we fit the cooperation function only to a small interval of, for example, 
N G [20,22] (i.e., intervals close to the maximal performance). 

Note that we are working with a single averaged value to describe the per- 
formance which does not fully catch the system's behavior. At least in some 
scenarios, as here in the emergent taxis scenario, the performance does not 
just continuously decrease due to continuously increasing interference. Instead, 
two coexisting phases of behaviors emerge: functioning swarms moving forward 



1 Fitted parameters: <n = 0.01061, b = 3.237, a 2 = 0.2138, c = -0.1823, d = 0.075 



9 



and pinned swarms with extreme numbers of u-turns. In emergent taxis this 
is shown, for example, by a histogram of barycenter speeds in Fig. |3(b)[ For 
N < 15 the mean of a unimodal distribution increases with increasing TV. Start- 
ing at about N = 15 a second phase emerges that shows slowly moving swarms 
and generates a bimodal distribution. Hence, given the fully deterministic im- 
plementation of our simulation, there are two classes of initial states (robot 
positions and orientations) that determine the two extremes of success or total 
failure. In other scenarios the interference might truly increase continuously 
due to a qualitatively different process, such as saturation of target areas with 
robots. 



3 Universal properties of collective decisions 

In the following we investigate macroscopic models of collective decisions. One 
of the most general and at the same time simplest model of collective decisions is 
a model of only one state variable s(t) which gives the temporal evolution of the 
swarm fraction that is in favor of one of the options in a binary decision process. 
If we assume that there is no initial bias to either option (i.e., full symmetry), 
then we need a tie breaker for s = 0.5. A good choice for a tie breaker is noise 
because any real swarm will be noisy. The average change of s depending on 
itself per time (As(s)/At) is of interest. Given that the system should be able 
to converge to either of the options at a time plus having the symmetric case 
of s = 0.5 we end up with at least three zeros for As(s)/At and consequently 
at least a cubic function. Instead of developing a model, that defines such a 
function, we prefer a model that allows this function to emerge from a simple 
process. In swarms the tendency to a certain option once symmetry is overcome 
(say, for s = Q.5 + e towards s = 1) is typically a result of positive feedback. We 
define such a process depending on probabilities of positive feedback next. 



3.1 Simple model of collective decisions 

As discussed in the introduction we d e fine a n urn model that was insp i red b y 
the models of lEhrenfest and Ehrenfestl ( 1907t ) and of lEigen and Winkler ( 1993f) . 



Here we use an urn model that has optionally positive or negative feedback 
depending on the system's current state and depending on a stochastic process. 
The urn is filled with N marbles which are either red or blue. The game's 
dynamics is turn-based. First a marble is drawn with replacement followed 
by replacing a second one determined by the color of the first marble. The 
probability of drawing a blue marble is implicitly determined by the current 
number of blue marbles B(t) in the urn. The subsequent replacement of a 
second marble effects either a positive or a negative feedback. The feedback 
is determined explicitly by a probability P(s, ip) that is based on the current 
ratio s of either the blue marbles in the urn or the ratio of red marbles. For 
now we choose a sine function 

P(s, (p) = (psm(irs). (8) 
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Due to the symmetry P(s, tp) = P(l — s, tp) it is irrelevant whether s is set to the 
ratio of blue marbles or the ratio of red marbles. Later we will find that similar 
but different functions might be a better choice in certain situations (Sec. 13. 4[) . 



The constant p G [0, 1] scales the amplitude of the sine function (see Fig. 6(a) I 
and consequently defines the predominant 'sign' and the probabilities of positive 
and negative feedback. The integral P(s, tp)ds gives the overall probability 
of having positive feedback independent of s. For tp < 0.5 negative feedback is 
predominant for any s and for tp > 0.5 an interval around s = 0.5 emerges for 
which positive feedback is predominant. 

Say we draw a blue marble, we notice the color, and put it back into the 
urn. Then our model defines that with probability P(s(t), ip) a red marble will 
be replaced by a blue one (i.e., a positive feedback event because drawing a blue 
one increased the number of blue marbles) and with probability 1 — P(s(t), p) 
a blue one will be replaced by a red one (i.e., a negative feedback event because 
now drawing a blue one decreased the number of blue marbles). Hence, the 
probability P(s, ip) gives the probability of positive feedback. P(s, <p) is plotted 



for different settings of <p in Fig. 6(a) There is maximum probability for positive 



feedback for the fully symmetric case of s = 0.5 as clearly seen in Fig. 6(a) For 
s = and s = 1 we have P(s, tp) = because no positive feedback is possible 
(either all marbles are already blue or all marbles are red and therefore no blue 
one can be drawn). For tp < 0.5 the probability of positive feedback is small 
(tp < 0.5, Vs : P(s, tp) < 0.5), consequently the system is stable and kept around 
a = 0.5. 

The analogy of this model to a collective decision making scenario is the 
following. The initial drawing resembles the frequency of individual decisions 
in the swarm over time proportional to s within the turn-based model. Con- 
sequently we serialize the system dynamics and each system state B(t) has at 
most two predecessors (B(t — 1) = B(t) ± 1). The replacement of the second 
marble resembles the effect of either a swarm member convincing another one 
about its decision or being convinced of the opposite. 

Based on the above definitions the average expected change per round AB 
of blue marbles B can be calculated by summing over the four cases: drawing a 
blue or red marble, followed by positive or negative feedback, multiplied by the 
'payoff' in terms of blue marbles respectively. Using the symmetry P(s, tp) — 
P(l — s, tp) we get 

AB(s) = sP(s, tp)(+l) + s (l - P( s , <p))(-l) 

+ (1 - s)P(l - s, <p)(-l) + (1 - ,s)(l - P(l - s, tp))(+l) 
4(P(s,v)-Q.5)(s-0.5). (9) 

We defined P(s, tp) based on a trigonometric function but alternatively one 
can choose, for example, a quadratic function 

P(s,tp) =<p(l-4(s -0.5) 2 ) (10) 

yielding a cubic function 

AB(s) = 2(s -0.5) - 16(s - 0.5) 3 . (11) 
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Also note that by turning positive feedback completely off (P(s) = 0) we obtain 
the Ehrenfest urn model again (cf. eqs. [T] and [9]) and by activating maximal 
positive feedback (P(s) — 1) we obtain the Eigen urn model (cf. eq. HJ). 

While in the above definition the positive feedback probability P(s, ip) is 
explicitly given, it might be unknown in applications of this model. In these 
applications the positive feedback itself might not be measurable but maybe the 
number of observed decision revisions of the agents. We introduce the absolute 
number of observed individual decision revisions from red to blue r(,(s) over any 
given period and from blue to red r r (s). The ratio of red-to-blue revisions is 
directly related to the expected average change per round 

''' ,( ' S| = 0.5(AB(s) + 1). (12) 



n{s) + r r (s) 

Based on ^t^~ ^ an d by considering the above mentioned four cases (drawing 
a blue or red marble, followed by positive or negative feedback) the following 
equation holds for red-to-blue revisions 

P(.), + (i-P(i-,))(i- 3) = - 5 r^, as, 

which yields (using symmetries) 

p(s) _ + for . 0.5, ^£1^ < m ,x(», 1 - .,, ,14) 

an equation which is to be used to determine the positive feedback probabil- 
ity based on measurements of agents' revisions. The pole at s = 0.5 and the 
consequently undefined P(s = 0.5) is reasonable. Any definition of P(s = 0.5) 
would be without effect to the system because negative and positive feedback 
are indistinguishable at s = 0.5. We actually define P(s = 0.5) in eq.0 which is 
however only a simplification of notation. Also note that this mathematical dif- 
ficulty has limited effect in applications because swarms are inherently discrete. 
For example, the exact s = 0.5 does not exist for odd swarm sizes. The squares 



in Fig. 6(a) give the redetermined P(s) based on eq. [Ml and on measurements 
of rfc(s) and r r (s). The values for P(s = 0.5) cannot be given, as discussed, and 
also values in the vicinity of s = 0.5 show discrepancies because they are close 
to the pole of eq. Q3J 



In Fig. 6(b) we compare the theoretical average change per round AB accord- 
ing to eq. [9] to the empirically obtained average change AB in terms of number 
of marbles for the different settings of (p. The agreement between theory and 
empiric data is close to perfect as expected. Two zeros si and S2 emerge addi- 
tionally to s = 0.5 for ip > 0.5: Si = ^ arcsin(^) and s 2 = 1 — \ arcsin(^). 
Positive values of AB(s) for s < 0.5 represent dynamics that has a bias towards 
s = 0.5 and negative values represent dynamics with a bias towards s = and 
vice versa for the other half (s > 0.5). 
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Fig. 6(c) gives an estimate of the asymptotic behavior of this urn model for 
varied feedback intensity <p. It shows a pitchfork bifurcation at tp = 0.5 which is 
to be expected based on Fig. |6(b)| and which is fuzzy because it is a stochastic 
process. For ip > 0.5 the curve defined by AB(s) becomes cubic and generates 
two new stable fixed points while the former at s = 0.5 becomes unstable. 



3.2 Optional extension of the model 

Since positive and negative feedback is determined stochastically based on the 
current global state in the urn model, it is straight forward to determine the 
number of replaced marbles stochastically, too. Instead of having a fixed 'payoff' 
(number of replaced marbles) of ±1 we can, for example, define a probability M 
of the event 'having a payoff of ±1' and a probability 1 — M for a payoff of 0. 
Thus the average payoff would be ±M. In addition, the payoff can be variant 
depending on the current global state s, that is, we define a function M(s). It 
turns out that a definition similar to the positive feedback probability P(s, ip) 
is useful 

M(s) = ci sin(7rs) + C2, (15) 

for appropriately chosen constants C\ and c<i. M(s) defines the average over 
absolute values of changes in s, similar to the diffusion coefficient in Fokker- 
Planck theo ry. Measurements of the diffusion c oeffic ient in swarm systems were 
reported in ( Yates et al. , 20091 : Hamann et al. , 201dl ) which show low values for 



s <C 0.5 and s ;§> 0.5 with a peak at s = 0.5. With M(s) defined by eq. [TBI we 
have symmetry again (M(s) = M(l — s)) and hence the extension of eq. [§]is 
simply 

AB(s) = 4M(s)(P(s, ip) - 0.5) (s - 0.5). (16) 



3.3 Available and unavailable methods 

Note that the recurrence B t = B t -\ + AB(B t -\) is a trigonometric or cubic 
function based on eq. [5] or eq. 1101 respectively. Thus, it is much more diffi- 
cult to be analytically handled and a concise result as for the Ehrenfest model 
(eq.[3]) cannot be obtained. When applying nonlinear equations for AB we enter 
the domains of nonlinear dynamics and chaotic systems with all their known 
mathematical intractabilities. Hence, in general we have to rely on numerical 
methods. 

However, if we choose to investigate probability distributions p(s, t) instead 
of single trajectories s(t), an interesting mathematical option is available. The 
steady state ir(s) of the probability distribution p{s,t) over an ensemble of 
realizations of s(t) can be obtained analytically. We assume for simplicity that 
the current state of the collective decision system changes every step by exactly 
one marble. The process defined by the urn model is memoryless, that is, it 
has Markov property and we can define a Markov chain with N + 1 states. We 
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define the transition matrix T of the Markov chain by 



T i+1 , i= i ( AB ( + 1 ) , for ie [2,7V], 



2 v V N 

Ti-i,t = 1 - \ (as (r^r) + x ) . for 1 e [ 2 . 

T wl = l-i(As(^)+l). (17) 

The steady state is then computed by determining the eigenvectors v 

Tv = Av. (18) 

Generally there are several eigenvectors but only one that has no changing sign 
in its elements (all positive or all negative) which represents the equilibrium 
distribution of the Markov chain 7r or —n respectively. 

With the steady state at hand, several methods of statis tical analysis ar e 



available. For example, we can calculate splitting probabilities (|Gardinerl 119851 ) . 
The splitting probability a a ,b(x) gives the probability that the system initialized 
at s = x will reach the state s — b before the state s = a. It is calculated by 

<r*,b{x) = n{s)- l ds ^jf tt(s)- 1 ^ . (19) 

We set a and b to the positions of the two peaks in the steady state probability 
density 7r that we obtain for tp = 1. Results for varied positive feedback inten- 
sity tp (while keeping a and b constant) are shown in Fig. [4] It is clearly seen 
that for tp < 0.5 there is a wide interval with a fifty-fifty chance to end at either 
of the two peaks. Beginning with tp = 0.5 and with increasing tp the probability 
of switching from one peak to the other is decreasing considerably which is an 
indicator for an effective collective decision. 

Another statistical property of Markov processes, especially those with bistable 
potentials, is the mean first passage time. Of interest is the mean time to switch 
from the collective decision for option A (s — s x with AB(s 1 ) = 0) to option B 
(s = s 2 with A£>(s 2 ) = 0). The switching time is of particular interest in the 
context of swarm intelligence because it tells w hether a swarm is able to stay 
for a considerable time in a given state (e.g., see Yates et al. ( 20091) ). In case of 



collective motion the mean switching time describes whether the swarm will be 
aligned long enough to cover a considerable distance. 

Markov theory allows to calculate the mean first passage time using the 
transition matrix T and specifying a target state i which is defined to be ab- 
sorbing (T^j = 1), that is, we convert the system into an absorbing Markov 
chain. Using the fundamental matrix M = (I — Q) _1 (with identity matrix I 
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Figure 4: Splitting probabilities cr a ,b between the two peaks in the steady state 
probability density ir for varied positive feedback intensity £ {0.1,0.2,...,!}. 



and Q is the transition matrix of transitional states) a vector of mean first pas- 
sage times for all transitional state s is ob tained by Mc with c is a column vector 
of all l's, see Grinstca d and Snell (Il997h . In addition an estimate of the mean 
switching time can be determined numerically in the urn model by generating 
trajectories s(t) and observing the switching behavior. This estimate is a lower 
bound (finite simulation time, finite number of samples, power-law distribution 
of first passage times). A comparison between theory and simulation along with 
fitted functions (a exp(bN)(cN) d ) is shown in Fig.[SJ The switching time scales 
approximately exponentially with swarm size N. 



3.4 Examples 

Next we want to compare the data from our urn model (Fig. 6(b) |) to data from 
more complex models, such as the density classification scenario (|Hamann and Worn 
20071 ) . In the following we apply the more general definition of AB (cq. IT6| but 
with an invariant payoff M (s) = c for a scaling constant c that scales the average 
change for different payoffs. 

The density classification scenario ( Hamann and Worn . 20071) is about a 
swarm of red and green agents moving around randomly. Their only interaction 
is constantly keeping track of those agents' colors they bump into. Once an 
agent has seen five agents of either color it changes its own color to that it has 
encountered most. Here, s gives the ratio of red agents. The name of this sce- 
nario is due to the idea that the swarm should converge to that color that was 
initially superior in numbers. It turns out that the averaged change As(s)/At 
(see Fig. |6(d)| starts with a curve similar to that of <p = in Fig. 6(b) and then 
converges to a curve that is similar to that of ip = 0.75. That is, As(s)/At is 
time-variant and, for example, the above mentioned Markov model would have 
to be extended to a time-inhomogeneous Markov model to achieve a better 
correlation with the measurements. Early in the simulation there is mostly neg- 
ative feedback forcing values close to s — 0.5. With increasing time the negative 
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Figure 5: Mean first passage times T between the two collective decisions S\ 
and S2 with AB(s\) = AB(s2) = over swarm size N, squares give values 
according to Markov theory, circles give measurements of the urn model, error 
bars give standard deviations, lines are fitted functions aexp(bN)N d . 



feedback decreases which results finally in positive feedback for s £ [0.23, 0.77]. 
Comparing Fig. |6(b)| to Fig. |6(d)| indicates a good qualitative agreement be- 
tween our urn model and the density classification scenario. Given that the 
curves in Fig. |6(d)| converge over time to the final shape which is resembled 
by our model for increasing tp in Fig. |6(b) one can say that positive feedback 



builds up slowly over time in the density classification scenario. By fitting 
cq. \W\ to the data shown in Fig. |6(d)| we get estimates for the feedback in- 
tensity tp. From the earliest and steepest line to the latest and only curve 
with positive slope in s — 0.5 we get values of tp £ [0, 0, 0, 0.007, 0.304, 0.603] for 
times t £ [100, 200, 400, 800,1 600, 3200]. By continuing this fitting for additional 
data not shown in Fig. |6(d)| we are able to investigate the temporal evolution 



of feedback intensity tp according to our model. In Fig. 6(e) the data points of 
feedback intensity tp obtained by fitting are shown along with a function that 
was fitted to the data. This result supports the assumption of a 'negative ex- 
ponential' increase o ver time (1 — exp(— t) ) of positive feedback in this system 
as already stated in ( Hamann et all [20101 ) . 



The positive feedback probability P(s) can also be measured via the observed 
decision revisions according to eq. 1141 This was done for the density classification 



scenario; the results are given in Fig. 7(a) which shows the measured positive 



feedback probability P(s) for time t — 7900. The data was fitted using the 
following axially symmetric function 

- -rr—) , for s < 0.5 

/>(.*) = < 1+c ^\ " , (20) 

l+c 2 (l-s) ) ' elSe 

for constants C\ and C2, which turns out to be a better fit here than the sine 
function. Once the function is fitted it can be used to calculate the expected 
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average change As following eq. \TE\ (only scaling constant c needs to be fitted). 
As seen in Fig. |7(b)| we obtain a very good fit. Indeed our experience shows that 
the procedure of fitting P(s) instead of As directly is much more accurate and 
needs fewer samples. Especially it places the zeros As = more accurate which 
are important to predict the correct steady state of the system. Even with just 
100 samples good fits were obtained which indicates that this model could be 
applied to data from natural swarm systems. The prediction of the steady state 
based on the measured positive feedback probability following cqs.[Tni[Hl andlT%l 



is shown in Fig. 7(c) In comparison to the measurements from simulation this 
prediction shows a reasonable agreement. 

Other example s showing si milarities to the <p = 0.75-graph in Fig. |6(bj| are 
Figs. 2B and 3B in lYates et al. I (|2009h which show the drift coefficient dependent 
on the current alignment of a swarm (averag e velocity). While the data obtained 
from experiments with locusts, Fig. 2B in ( Yates et all 20091) . is too noisy, we 

l2009h 



use the data from their model, Fig. 3B in ([Yates et all 12009) . to fit our model 



The result is shown in Fig. |6(f)[ We obtain a maximal positive feedback of (p = 1. 

4 Discussion and Conclusion 

We have reported two abstract swarm models with high generality due to our 
long-term objective of creating a swarm calculus. The first model describes the 
dependency of swarm performance on swarm density by separation into two 
parts: cooperation and interference. It explains the existence of an optimal or 
critical swarm density at which the peak performance is reached. The second 
model describes the dynamics of collective decision processes based on the exis- 
tence and intensity of feedback. It explains how the cubic functions of decision 
revision emerge by an increase of positive feedback over time. 

The first model is simple and the existence of optimal swarm densities is well 
known. However, to the authors knowledge, no similar model combined with 
a validation by fitting the model to data from divers e swarm applicati o ns wa s 
reported before except for the hypotheses stated by l0stergaard et al ( 2001 ). 



Despite its simplicity the model has the capability to give predictions of swarm 
performance, especially, if the available data, to which it is fitted, includes an 
interval around the optimal density. That way this model might serve as a swarm 
calculus of swarm performance. In addition, we want to draw attention to the 
problem of masking special density-dependent properties by only investigating 
the mean performance. The example shown in Fig. |3(b)] documents the existence 
of phases in swarm systems. 

The second model is also abstract but has a higher complexity and is more 
conclusive because it allows for mathematical derivations. Based on our urn 
model for positive feedback decision processes the emerging cubic function of 
decision revision can be derived (see eq. ^ . Hence, we generate the function of 
decision revision based on our urn model whic h allows for an int erpretation of 
how the function emerges while, for example, in ( Yates et al. , 20091) this function 



is measured in a local model. Our model of collective decisions might qualify as 
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(b) Average change of B(t) in terms of mar- 
bles, lines according to eq. [9] squares give 
empirical data, number of samples is 8 X 10 s 
for each possible s, 64 marbles. 
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(c) Normalized histogram of blue marbles B 
over intensity of feedback ip after t = 200 
steps, initialized to -B(O) £ {32, 33}, indicat- 
ing a pitchfork bifurcation at <p = 0.5. 
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(e) Negative exponential function ip(t) = 
0.786 - exp(-5 X 10~ 4 i) fitted to feedback 
intensities obtained from the density classi- 
fication scenario. 
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(d) Density classificatio n sce- 
nario (Ha mann and W orn. l2007fl . change of 
the ratio of red robots for different times 
during sim ulation, squares give empirical 
data (from (Hamann ct al., 2010)), lines are 
fitted according to eq. 1161 
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(f) Model fitted t o data from Fig. 3B of 
Yates et al. (2009) (local model of swarm 
alignment in locusts) by ip = 1 (and c = 
4.134 X 10 -3 ); data scaled to s £ [0, 1]. 



Figure 6: Settings of the positive feedback probabilities, resulting average 
change in B{t) in the urn model over the ratio of marbles s, histogram of blue 
marbles for varied intensity of feed back (p, comparison of mod el and results from 
the density classification scenario (|Hamann and Wornl . I2007T ) , incr ease of posi- 



tive fe edback over time, and comparison of model and results from lYates et al 
(120091) . lg 
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(a) Positive feedback probability as mea- 
sured in the simulation according to 
eq. I14I (squares) and fitted function of 
eq. [20] (line). 



(b) Predicted average change based on the 
measurements shown in (a) and eg, 1 161 (line) 
compared to direct measurements in simula- 
tion (squares). 




(c) Predicted steady state obtained by us- 
ing the measurements shown in (a) and by 
applying eqs. 1161 1171 and 1181 (line); direct 
measurement in simulation (squares). 



Figure 7: Measured positive feedback probability, predicted and measured av- 
erage change As, and predicted and measured steady state for the density clas- 
sification scenario. 
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a part of swarm calculus because those decision revision functions seem to be a 
general phenomenon in swarms. 

This model can also be used to predict probability density functions of steady 
states in swarm systems. The workflow of measuring the positive feedback prob- 
ability P(s), fitting a function, using this function to calculate the expected 
average change As, which can in turn be used to predict the expected proba- 
bility density function of the steady state (eigenvectors of transition matrix), is 
accurate with comparatively small sample numbers. 

A model o f notable similarity was published in the context of 'sociophysics' 
( Galaml . |2004) . It is based on the assumption that subgroups form in collective 
decisions within which a majority rule determines the subgroup's decision. The 
addition of contrarians, that is voters always voting against the current majority 
decision, generates dynamics that are similar to the reported observations in 
noisy swarms. Galam's model is, however, focused on constant sizes of these 
subgroups and their combinatorics while in swarm intelligence these groups and 
their sizes are dynamic. 

An interesting result is also the particular function of the positive feedback 
increase over time (1 — exp(— t)) in the density classification task (see Fig. 6(e) ). 
Note that this increase seems to be independent from respective values of s. 
Furthermore, values close to the bounds (s « 1 or s « 0) are not observed. 
An investigation of the underlying processes is beyond this paper but we want 
to state two ideas. First, the final saturation phase (lim t _ ) . 00 cp — 0.8) is most 
likely caused by explicit noise in the simulation. The agent-agent recognition 
rate was set to 0.8 which keeps P(s, ip) small. Second, the initial fast increase 
of <p (after a transient which might also be caused by the simulation because 
agents revise their color only after a minimum of five agent-agent encounters) 
might be caused by locally emerging sub-groups of homogeneous color within 
small areas that generate 'islands' of early positive feedback. These properties 
might, however, be highly stochastic and difficult to measure. Time- variant pos- 
itive fe edback was also obse rved in BEECLUST-controlled swarms as reported 
before ( Hamann et al.l . l2012|) . Hence, a feedback system as given in Fig. [5] seems 
to be a rather common situation in swarm systems. In terms of the above urn 
model we can mimic this situation by saying A in Fig. [5] is the number of blue 
marbles (w.l.o.g.), B is the probability of drawing a blue marble, P is the prob- 
ability of positive feedback (i.e., this edge can also negatively influence A), and 
C is an unspecified state variable that increases positive feedback [ip) over time 
and is influenced by an additional, unknown process. This triggers the question 
of what C can be and how it influences the feedback process independent of the 
current swarm consensus s. 

We get maximally positive feedback ip = 1 for the data of Yates et al. ( 20091 ) 
as seen in Fig. |6(f)| with the effect that situations of low a lignment (s 0.5) 
are left as fast as possible. This reinforces the findings of lYates et al. 



about the diffusion coefficient. A major feature of the self-organizing processes 
in the swarm seems to be that times in states of low alignment are minimized 
by the system (Yates et al.: "A higher diffusion coefficient at lower alignments 
suggests that the locusts 'prefer' to be in a highly aligned state"). 
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Figure 8: Time- variant feedback system; here for increasing probability of pos- 
itive feedback. 

The result of this paper is that generally applicable swarm models, that 
have simple preconditions, exist. To apply the model of swarm performance, 
only a concept of swarm density is necessary and to apply the model of collective 
decisions only a consensus variable of a binary decision is necessary. Despite 
their simplicity, both models have enough explanatory power to give insights 
into swarm processes such as the interplay of cooperation and interference and 
the installation of positive feedback. Hence, we contend that it is possible to 
generate a set of models and methods of general applicability for swarm science, 
that is, to create a swarm calculus. 
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